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1.  Introduction.  Consider  an  N  component  system,  of  known  structure, 
that  is  maintained  by  a  limited  number  of  R  repairmen,  i.e.,  R  *  N  .  The 
objective  is  to  characterize  dynamic  maintenance  policies  that  yield  a  maximum 
value  to  a  system  measure  of  performance  such  as  the  expected  discounted 
system  operation  time  and  the  average  expected  system  operation  time,  or 
availability  of  the  system.  We  make  the  following  assumptions.  Each 
component  and  the  system  as  a  whole  can  be  in  only  two  states,  functioning 
or  failed.  The  functioning  and  repair  times  for  the  i*h  component  are 
exponentially  distributed  random  variables  with  known  parameters  |ij  and  X}  . 
Components  are  independent  i.e.,  failure  or  repair  of  one  has  no  effect  on  the 
others.  At  most  one  repairman  may  be  assigned  to  a  failed  component  and  it 


is  possible  to  reassign  a  repairman  from  one  failed  component  to  another 


instantaneously.  Repaired  components  are  as  good  as  new  and  failures  may 


take  place  even  while  the  system  is  not  functioning. 


Since  the  number  of  available  repairmen  is  less  than  the  number  of 


components,  the  performance  of  the  system  will  depend  on  the  maintenance 


policy  employed,  i.e.,  the  rule  for  choosing  on  which  failed  components 


repairmen  are  assigned  to  whenever  the  number  of  failed  components  is 


greater  than  R  .  Under  these  assumptions  optimal  policies  can  be  obtained, 


in  principle,  using  methods  from  Markovian  decision  theory.  However,  the 


computational  difficulties  are  prohibitive  due  to  the  very  large  number  of 


possible  states.  Therefore,  explicit  solutions  and  approximations  can  provide 


valuable  insight.  An  explicit  solution  has  been  obtained  for  the  series  system 


with  N  components  maintained  by  a  single  repairman  in  Katehakis  and 


Derman  (1984);  see  also  Derman  et  al.  (1978),  Nash  and  Weber  (1982)  and  Smith 


(1978). 


In  practice  many  systems  are  composed  of  highly  reliable  components.  We 
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have  a  aimple  model  for  such  systems  if  we  assume  that  the  failure  rate  for 
the  i^h  component  is  of  the  form  PVi  ,  1  <  i  <  N  .  Thus,  for  small  values 
of  p  ,  all  components  are  highly  reliable.  We  obtain  analytical  character¬ 
izations  and  derive  simple  recursive  formulas  for  the  determination  of  policies 
that  are  optimal  for  small  values  of  p  .  These  asymptotically  optimal  policies 
can  be  used  as  good  approximations  to  optimal  policies  for  systems  composed 
of  highly  reliable  components. 

The  first  study  of  such  a  model  was  done  in  Smith  (1978).  We  extend  the 
work  of  Smith  in  the  following  directions,  i)  We  provide  a  formulation  of  the 
problem  along  the  lines  of  Markovian  decision  theory.  ii)  We  treat  the 
multirepairmen  case.  iii)  We  note  that  the  recursive  formulas  for  the 
determination  of  asymptotically  optimal  policies  essentially  constitute  a  Gauss 
succesive  approximations  method  for  solving  the  general  Markovian  decison 
problem,  iv)  We  establish  the  existence  of  intervals  of  the  form  (0,p*)  with 
the  property  that  if  they  contain  the  failure  rates  of  all  components,  then  the 
asymptotically  optimal  policies  under  consideration  are  optimal,  and  v)  in  the 
final  section  we  find  asypmptotically  optimal  policies  for  the  series,  parallel 
and  a  system  composed  of  parallel  subsystems  connected  in  series.  For  the 
series  system  maintained  by  R  ,  R  »>  2  ,  repairmen  it  is  asymptotically 
optimal  to  assign  repairmen  to  failed  components  with  the  longest  expected 
repair  times  first.  Thus,  we  show  that  the  series  system  result  established  in 
Katehakis  and  Derman  (1984)  does  not  hold  in  the  case  of  more  than  one  re¬ 
pairmen. 

2.  Problem  Formulation.  Under  the  assumptions  made,  at  any  time  the  status 
of  all  components  is  given  by  a  vector  x  =  (xi,...,xn)  with  xj  =  1  or  0  if 
the  i*h  component  is  functioning  or  failed.  Thus  S  =  {0,1}N  is  the  set  of 
all  possible  states.  The  structure  of  the  system,  i.e.,  the  relation  between  the 
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status  of  the  components  and  that  of  the  system,  is  given  by  a  partition  of 
the  state  space  S  into  two  sets  G  and  B  of  "good"  and  "bad"  states; 
where  if  x  €  G  the  system  is  functioning  and  if  z  e  B  the  system  is  failed. 
Alternatively,  this  relation  can  be  specified  by  the  structure  function  + 
defined  on  S  ,  such  that  t(x)  =  1  or  0  if  zeG  orzeB. 

It  is  easy  to  show,  using  well  known  results  of  Markovian  decision  theory, 
that  the  only  relevant  decision  epochs  are  component  failure  and  repair 
completion  instants  and  that  it  suffices  to  consider  only  the  class  of 
deterministic  policies  that  never  leave  repairmen  idle  while  there  are  some 
failed  components.  A  policy  is  called  deterministic  it  it  assigns  repairmen  to 
failed  components  as  a  deterministic  function  of  the  state  of  the  system  only. 
Let  (I  denote  this  finite  set  of  deterministic  policies  that  never  leave 
repairmen  idle  while  there  are  some  failed  components.  The  information 
pattern  used  by  the  controller  at  decision  epochs  is  the  state  of  the  system  at 
those  instants.  When  a  policy  it  €  il  is  employed  the  time  evolution  of  the 
state  of  the  system  can  be  described  by  a  continuous  time,  finite  state, 
irreducible,  Markov  chain  {xn(t)  =(x^(t) , . . . ,xjj(t) ) ,  t  *  0},  where  x?(t)  =  x^ 
if  the  i^  component  is  in  state  xj  at  time  t  .  Thus,  when  the  initial 
state  of  the  system  is  z  and  a  policy  it  €  II  is  employed,  the  average 
expected  system  operation  time  and  the  expected  discounted  system  operation 
time,  for  a  given  discount  rate  0  ,  are  defined  by  (1)  and  (2),  respectively 

(1)  An(x)  =  lia  4  8  {  f(xn<t))  dt  /  x”(0)  =  x 

T-*co  1  l  0 

(2)  D*(x,0)  =  S  [jQ  e~*tt(xn(t))  dt  /  xn(0)  =  x 
Notice  that,  for  zed,  (x*(t)  ,  t  *  0  }  is  ergodic  thus,  we  have 


(3) 


f 


A  (x)  =  A(n)  =  l 
11  xeG 

where  (e^Cx),  x  c  {0,1 
{x*(t),  t  x  0  }. 


•«<*> 


}  is  the  set  of  ergodic  probabilities  of 


It  will  be  convenient  to  work  with  the  quantities  U*(x,3)  which  denote 
the  expected  discounted  tine  that  the  system  does  not  function  when  the 
initial  state  is  x  and  policy  n  is  used.  Let  t(x)  =  1  -  +(x)  then, 
Un(x,3)  is  given  by  (4)  below. 


(4) 


Mx,g)  =  B 


f /• 

( 

lJ0 


-Rt  t  (x"(t))  dt  /  xW(0)  =  x 


It  is  easy  to  see  that  Dgtz.ft)  +  U*(*  ,3)  =  1/3  ;  thus,  maximizing  Dn(x,3)  is 
equivalent  to  minimizing  Un(x,3)  . 

Let  R(x)  =  min  {R,  |Co(x)|}  denote  the  maximum  number  of  components 
that  can  be  under  repair  when  the  system  is  in  state  x  .  A  deterministic 
policy  n  is  specified  by  sets  of  the  form  x(x)  =  {*i(x)  c  Co(x),  1  *  i  * 
R(x)}  ,  for  all  states  x  ;  where  n(x)  denotes  the  set  of  components  under 
repair  when  x  is  used  and  the  system  is  in  state  x  .  We  assume  that 
components  in  n(x)  are  ordered  in  a  consistent  way  for  all  x  in  S  ,  but 
for  notational  simplicity  we  do  not  define  this  order  explicitly.  Thus,  Xj(x) 
denotes  a  specific  failed  component  on  which  a  repairman  is  assigned  under 
x  when  the  system  is  in  state  x  .  Under  policies  in  II  ,  repairmen  are 
never  left  idle  when  there  are  enough  failed  components  thus,  n(x)  s  Co(x) 
for  all  states  x  such  that  R  >  lCo(x)l  .  However,  for  states  x  such  that 
R  <  *Co(x)l  there  is  a  choice  for  R(x)  ,  and  the  problem  is  to  choose  these 
sets  so  as  to  maximize  A(n)  or  D^(x,3)  • 

Finally,  we  assume  that  the  system  under  consideration  is  coherent,  i.e., 
we  place  the  following  restrictions  on  0  and  B  (or  equivalently,  on  f  )• 
i)  If  x  «  Q  and  j  *  x  (i.e.,  yj  »  X|  ,  1  <  i  *  N)  then  j  c  G  and  if 
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z  €  B  and  y  *  x  than  y  €  B  .  ii)  For  any  component  i  »  1  *  i  4  N  , 
there  exists  a  state  x  e  G  such  that  state  (Oi,x)  e  B  ;  where  we  use  the 
notst ion*  ( •  (ip  •  •  <  i ^ •  *^+1*  •  •  •  * x^)  *  ^  *  Oi  1  • 

We  shall  use  the  following  notation  and  terminology  from  coherent 
structure  theory;  see  Barlow  and  Proschan  (1974).  For  any  state  x  €  S  we 
define:  CQ(x)  =  {i  |  xi  =  0}  ,  C^x)  =  (i  I  *j  !  1)  •  Given  any  finite 
set  C  ,  |C(  will  denote  the  number  of  elements  in  it.  A  state  x  e  B  such 
that  y  c  G  for  any  y  a  x  ,  y  *  x,  is  called  a  cut;  it  corresponds  to 
minimal  set  of  components  which  by  failing  cause  the  failure  of  the  system. 

The  size  of  a  cut  x  is  the  number  of  components  in  Cq(x)  • 

3.  Asymptotically  Optimal  Policies.  We  first  consider  the  expected  discount¬ 
ed  system  operation  time  criterion  and  then,  using  the  results  obtained,  we 
treat  the  maximum  availability  problem. 

By  conditioning  on  the  first  transition  out  of  state  x  we  obtain  that 
under  a  deterministic  policy  if  the  Uft(x,9)’>  ,  x  e  S  ,  are  the  unique 
solution  to  the  following  system  of  linear  equations 

(5)  Wx)  *  x.(«) -rfirurr  a  [  ♦(x)  *  p  i  Vjwoj,xl  1 

if  l = i  j— i 


x  €  S,  where  we  set  X  (x)  =  I  Xx.(x)  ,  p(x)  =  E  p.x.  and  Kl.,1)  =  0  . 

n  .=1  '  j=l  J  J  J 

It  is  known,  Oerman  (1970),  that  there  exists  a  deterministic  policy  if*  such 
that 

(6)  Un*(x.9)  *  Un<x,&)  V  x  €  S,  V  it  *  «*  . 

Furthermore,  since  the  state  space  is  finite,  it  follows  (Derman  (1970)  that 
there  exists  a  4o  >  0  and  a  deterministic  policy  it0  such  that 

(7)  U  «(x,0)  <  U_(x,»)  V  x  c  S,  V  it  *  «•  and  V  9  €  (0,*o)  . 


and 


* 


(8)  lig  »  U^Cx.B)  =  en0(x)  =  A(n°)  ,  ¥  x  €  S  . 

Note  that  under  n°  the  availability  of  the  systea  is  equal  to  1  -  A(n°)  . 
Thus,  it  follows  froa  (7)  and  (8)  that  if  a  policy  niniaizes  Un(x,&)  for 
saall  values  of  9  >  then  it  aaxiaizes  the  availability  of  the  systea. 

We  first  prove  the  following 


un(x,B)  =  l  ui'/)(x,9)pw 


Lemma  1-  For  any  x  €  S  ,  $  €  (0,®)  and  n  €  FI  ,  there  exist  power  series 

expansions  of  Uu(x,B)  ,  of  the  fora 

o 

l 

0 

Proof.  We  can  write  the  system  of  equations  (5)  in  the  following  form: 

(1°)  iK*)  =  &  [♦(«)  +HrXi«)wl"i^)'x)  +  9 }  »*jxj («°j,x)  -  «*>>] 

x  i=l  j= 1 

x  €  S  .  In  aatrix  fora  (10)  can  be  written  as:  £T  =  a  ♦  B U+  pCU  where  sub¬ 
scripts  and  arguments  have  been  supressed  and  a,  B,  C  are  appropriately  de¬ 
fined.  It  is  obvious  that  B  is  triangular  so  that  (I-B)  *  exists  and  aay 
be  coaputed  in  a  recursive  fashion.  Thus,  U ~  (I-B)~la  ♦  p(I-B)  lC U  or  in 


aore  coapact  fora: 


(11)  b(x,B)  ♦  p  Q(*,B)tf 

It  follows  froa  (11)  that  for  any  k  *  1  ,  we  have: 

(12)  U*(x,|)  =  b(*,B)  +  £  pw(Q(x,B))vb(ii,B)  ♦  (Q(n,B))v(U1l(x,B))pk+1 

1 

Now  let  ||Q(it,B)ll  denote  a  nora  of  the  aatrix  Q(«,B)  ,  then  (12)  iaplies 
that  (9)  holds  for  all  p  <  (0,  1/| |Q(x,|) | | )  . 


The  next  corollary  provides  a  method  for  computing  the  coefficients 


(y) 

(x,B)  recursively  for  increasing  |Cq(I)|  .  For  notational  simplicity  we 
set  U<w)(l.,l)  =  0  . 

«  j 

Corollary  1.  For  any  x  €  S,  0  €  (0,®)  and  n  e  fl,  the  l£v)(x,9)*s  can 

If 

be  computed  recursively  from  equations  (13),  (14)  below 


(13a)  U<0)(1,&)  =  0  , 

It 


(13b,  o;°>(x.e,  =  \  (x 


frit 


(14)  uiv+1)(x, 


-  R(x)  ,q.  i 

♦(x)  +_Eixni(X)u;u;{(in.(JI),x),&)  j  , 


1  f 

*>  =  nwrj  "(Uni!.,.*).*)  * 

x  1=1 

£  (V\  ( v )  i 

£  n.x.(u;v,((o.,x),&)  -  u  (x.p))  , 
j=i  j  j  n  j  *  J 


v  s  0  ,  X  €  S  . 

It  follows  from  (11)  that  -  (l  -  B)  *a  and  since  fl  is  triangu¬ 
lar,  0<°>  can  be  computed  recursively  by  =  a  +  BU^  which  is  (13). 

Similarly,  U<V+l)  =  (I  -  B)'1CU(v)  thus,  U^+1)  =  Bl/V+1)  +  CU(v)  which  is 
(14). 

Remark  1.  Note  that  equations  (13),  (14)  constitute  a  Gauss  Seidel  iteration 
method  for  solving  the  system  of  linear  equations  (5)  for  a  specific  choice  of 
initial  points.  Thus  the  overall  approach  of  determining  policies  that  minimize 
the  leading  coefficients  is  essentially  equivalent  to  employing  a  so  called 
pre-Gauss  Seidel  iteration  for  the  under  consideration  Markovian  decision 
problem,  see  Thomas  et  al.,  (1984)  and  references  given  there. 

We  next  aim  to  determine  the  leading  coefficients  of  the  power  series 
(9). 

We  first  need  to  define  the  following  quantities.  Let 


n(p)  -  min{IC  (x)  |  x  €  B},  B^^  =  {x  e  B  |  |CQ(x)|  =  ■(♦)}  and 
I(x)  =  min{|C0(y)|  |  y  *  x  ,  y  e  B}  -  |CQ(x)|  .  In  the  terminology  of  cohe¬ 
rent  structure  theory,  m(+)  is  the  size  of  a  cut  state  of  minimal  size, 

B  x  is  the  set  of  all  such  states  and  I(x)  is  the  minimum  number  of  com- 

m(y) 

ponents  that  must  fail  when  the  system  is  in  state  z  ,  in  order  to  cause  a 
system  failure.  The  next  lemma  summarizes  properties  of  I(x)  that  are  easily 
verifiable  from  its  definition. 

Le—a  2.  For  any  state  x  the  following  are  true. 

(i)  I ( x)  *  m(p)  =  1(1)  . 

(ii)  1(0., x)  »  I(x)  -  1  ,  Vic  Cx(x)  . 

(iii)  If  p(x)  =  1  then  I(x)  *  1  . 

(iv)  If  y  c  c  C  (y)  i=l,2,...v  and  x  =  (1^  , . . ..  lj^.y)  then 

I(x)  =  v  . 

We  can  now  prove  the  following 

Lemma  3.  For  any  x  c  S  ,  any  n  c  fl  and  for  any  k  =  0,l,...a(t)  -  1  if 
I ( x)  »ktl  then  uik)(x,0)  =  0  . 

n 

Proof.  By  induction  on  k  and  subinduction  on  ICq(x)I  . 

i)  For  k  =  0  ,  i.e.  I(x)  ‘  1  we  have: 

a)  if  |Crt(x)|  =  0  then  x  =  1  and  therefore  U^^(1,B)  =  0  by  lemma  1  , 

b)  assume  that  the  lemma  holds  for  all  x  such  that  ICq(x)I  =  v  .  Then 

(13),  the  induction  hypothesis,  and  the  observation  that:  $(x)  =0  when 

I(x)  »  1  and  |Cn(l  ,  x)|  =  v  when  |Crt(x)|  =  v  +  1  ,  imply  that  the  lemma 

U  Tli  U 

holds  for  all  x  such  that  |Cq(x)|  =  v  +  1. 

ii)  Assume  that  the  lemma  is  true  for  k  =  ko  *■  ■  (♦)  -  1  •  We  next  show  that 
it  holds  for  k  =  ko  +  1  .  Then: 


x  =  1  then  (14)  becoaes: 


a)  II  |CQ(x)|  =  0  ,  i.e. , 

U^k°+1)(l,$)  =  ~  [  E  Hj(U^k°)((Ojll),ft)  -  U^ko)(l,p))  ] 

Note  now  that  1(0. ,1)  *  1(1)  -  1  *  ko  ,  by  leva  2  (ii)  .  Thus,  the  result 

j 

follows  since  by  the  induction  hypothesis  (ii)  we  have  that: 

Unk°)((0j‘1)*P>  =  =  0  • 

b)  Assuae  that  the  leana  holds,  for  any  x  such  that  ICq(x)I  =  v  . 

Consider  a  state  x  such  that  |Cn(x)|  =  v  +  1  .  Then  |Crt(l_  .  .  ,x)|  =  v  , 

u  u  ni(x) 

and  the  induction  hypothesis  (b)  iaplies  that:  ufk°+1\(l_  ,  . ,x),&)  =  0  . 

X  lljlX,' 

Notice  also  that  l(0.,x)  »  k(j  ,  I(x)  a  ko  ♦  1  *  ko  ,  thus  induction  hypothe- 

J 

sis  (ii)  iaplies  that  U^k^((0  .,x),4)  =  uik^(x,(0  =  0  .  Now  it  is  easy  to 

n  j  a 

coaplete  the  induction  step  using  (14)  . 

A  consequence  of  Lenmas  1  and  3  is  the  next 

Theorea  1.  For  any  x  e  S  and  any  x  e  H,  there  exist  constants  uiI^x^(x,ft) 
in  (0,®)  ,  such  that: 

(15)  iyx.6)  =  U^I(x))(x,&)pI(x)  ♦  0(pIfx)) 

where  the  U^^x^(x,&)’s  can  be  deterained  recursively  as  follows. 

i)  For  all  states  x  such  that  I(x)  =  0  , 

(16)  (Jfl  ^(x,ft)  =  j  +  j  |  4(x)  +  £^x^(x)^^^^Hj(x)»*)»9)  ]  • 

ii)  For  all  x  such  that  I(x)  *  1  , 


(17)  U, 


(!(«)) 


l  f  R(x) 

(x,ft)  =  A_(x)  ♦  ft  l  J  X"i(x)u, 


iI(x))(a«i(x).«).ft)  ♦ 


M 

♦  t  l»i>c1(uiI(x)"l)((0J,x)I#)  1 


V 


iV 


Proof.  Equations  (16)  and  (17)  follow  fron  corollary  1  and  leoma  3.  To  show 
that  u^(*))(X|£)  *  0  for  all  x  ,  notice  that  this  is  true  for  all  x  such 

n 

that  I(x)  =  0  .  The  proof  can  be  completed  by  induction  on  I(x)  using  (17). 

Theorea  1  shows  that  the  order  of  the  leading  terns  in  the  asynptotic 
power  series  expansion  of  Un(x,8)  is  the  sane  for  all  deterministic 
policies.  Thus,  we  can  formally  state  the  following. 

Proposition  1.  A  policy  it*  €  II  naxiaizes  the  expected  total  discounted  sy¬ 
stem  operation  tine  for  snail  values  of  p  if  and  only  if: 

(18)  U  ,(x.P)  =  nin{  U<I(x))(x,&)  ,  n  €  11  >  ,  V  x  €  S  . 

n  n 

In  the  absence  of  ties  (18)  determine  unique  asymptotically  optimal  act¬ 
ions  for  all  states.  Ties  can  be  resolved  by  computing  and  minimizing  higher 
order  coefficients  subject  to  minimization  of  all  lower  order  coefficients. 

Remark  2.  Since  u!^(l,&)  =  0  for  k  =  0,  1,  ...  ,  1(1)  -  1  ,  the  coef- 

1C 

ficients  U^(x,jj)  f  for  all  x  *  1  ,  k  *  I(x)  -  1  ,  are  the  sane  with 
those  in  the  asynptotic  power  series  expansion  of  the  expected  discounted  tine 
that  the  systen  spends  in  failed  states  during  the  first  passage  tine 
fron  state  x  to  state  1  under  policy  n  .  Thus,  since  I(x)  *  1(1)  -  1 
for  all  x  *  1  ,  we  have  the  following  partial  characterization  of  asympto¬ 
tically  optimal  policies.  If  a  policy  is  asymptotically  optimal  with  respect 
to  the  expected  discounted  systen  operation  tine  criterion  then,  it  must 
assign  repairmen  to  failed  components  in  such  a  way  that  the  expected  dis¬ 
counted  tine  that  the  systen  spends  in  failed  states  during  the  first 
passage  tine  from  state  x  to  state  1  is  mininized. 


We  now  turn  to  the  problea  of  deteraining  n  €  II  to  aaxinize  the 
availability  of  the  systea.  We  have  seen  that  for  this  problea  it  suffices 
to  deteraine  policies  that  ainiaize  the  leading  coefficients  (x,0)  of 

the  power  series  (9)  for  snail  values  of  the  discount  rate  9  .  Thus,  let 

viv*(x)  =  lia  U^(x,&)  ,  for  x  *  1  and  v  <  1(1)  -  1  .  Then,  a  policy  x 

* 

(  y  \ 

ainiaizes  tr  (x,g)  for  snail  for  snail  values  of  9  if  and  only  if  it 

X 

ainiaizes  V'  (x)  .  Now,  using  lean  3  and  theorea  1  we  obtain  the  following 
1» 

(v) 

procedure  for  coaputing  and  ainiaizing  the  V*  (x)'a  directly. 


Theorea  2.  For  any  x  €  S  ,  x  *  1  and  any  n  c  II  ,  we  have 

(19)  V<v)(x)  =0  for  v  *  I(x)  -1  , 

H 

and,  the  V^^x^(x)*s  can  be  determined  recursively  as  follows. 

i)  For  all  states  x  such  that  I(x)  =  0  , 

(20)  V<0)(K)  =  j-ijy  [  ♦(«)  ♦8I*i,l(*)V*0)((l,i(,),x))  ]  . 

n  i=l 

ii)  For  all  x  such  that  I(x)  *  1  f 


;i(x))(dxi(x).x» 


(2l>  =rk)  ( 

■*  "  ‘tixi(ViI(X)*1)((0i’X))  1  • 

j_ ^  J  J  »  J  1 


iii)  For  I(x)  *  v  <  1(1)  -  1  , 

R(x) 


Thus  we  can  now  foraaly  state  the  following 


A  policy  xed  maximizes  the  availability  of  the  system,  for 


small  values  of  p  if  and  only  if  : 

(23)  vi{(x))(%)  =  ein{  viI(x))(x)  ,  x  €  II  }  .  V  x  €  S  ,  x  *  1  . 

H  u 

In  the  absence  of  ties,  (23)  determines  a  unique  asymptotically  optimal 
policy.  Tiea  can  be  resolved  by  considering  higher  order  coefficients  as 
computed  by  (22). 

Bcanrk  3.  Notice  that  when  the  system  is  in  a  failed  state  x  ,  I(x)  =  0 
and  the  V^^(x)  as  determined  f roe  equations  (20)  is  the  expected  time  until 
the  systea  is  back  in  operation,  in  the  absence  of  failures.  Thus,  we  obtain 
the  following,  intuitively  expected,  partial  characterization  of  asymptotically 
optimal  policies.  If  a  policy  x®  maximizes  the  availability  of  the  system,  for 
small  values  of  p  ,  then  when  the  system  is  failed,  it®  must  assign 
repairmen  to  failed  components  in  such  a  way  that  the  expected  time  until  the 
system  is  back  in  operation,  in  the  absence  of  failures,  is  minimized. 

In  the  next  theorem  we  show  that  asymptotically  optimal  policies  are 
strictly  optimal  when  all  failure  rates  are  sufficiently  small. 

Theorem  3.  Let  x*  be  an  asymptotically  optimal  policy,  with  respect  to  one 
of  the  criteria  that  have  been  considered.  Then,  there  exists  a  Pq  *  0  such 
that  x*  is  optimal  V  p  €  (0,Pg)  . 

Proof.  We  prove  the  theorem  for  the  expected  discounted  operation  time  cri¬ 
terion  only.  The  proof  for  the  maximum  availability  criterion  is  similar  and 
is  omitted. 

Recall  that  for  any  policy  x  €  II  and  for  p  €  (0,  l/HQ(x)M)  the 
Ux(x,9)'s  possess  convergent  power  series  expansions.  Since,  there  are 
finite  many  policies  in  II  ,  it  follows  that  the  above  power  series  represent- 


at ions  of  all  U*(x,B)'s  are  convergent  for  all  «  €  fl  in  the  interval 
(0,^)  ,  where  pj  =  ai||  {  l/MQ(tt,g)  1 1  >  . 

Now  for  any  x  €  S  and  ,  Xg  €  R  »  **  follows  (see  Rudin  (1976,  pp. 

177))  that  the  difference:  Un^(x,£)  -  Bay  change  sign  a  finite 

nuxber  of  times.  Thus  the  theorem  follows  from  Proposition  2  and  the  fact 
that  there  are  finite  many  policies  in  n  and  states  in  S  . 

4.  Applications.  In  the  following  examples  we  restrict  our  attention  to  de¬ 
termining  policies  which  are  asymptotically  optimal  with  respect  to  the  avail¬ 
ability  criterion. 

4.1  Series  and  Parallel  Systems.  Consider  first  the  N  component  series 
system  maintained  by  R  repairmen.  The  only  functioning  state  is  state 
1  =  (1,...,1).  From  Proposition  2,  remark  3  we  know  that  an  asymptotically 
optimal  policy  n*  minimizes  the  expected  time  to  state  1  from  any  initial 
state  x  in  the  absence  of  failures.  Thus  for  any  initial  state  x  ,  in  the 
terminology  of  stochastic  scheduling,  an  asymptotically  optimal  policy  minimi¬ 
zes  the  expected  makespan  for  allocating  |Cq(x)|  tasks  (repairs)  on  R 
identical  processors  (repairmen).  For  R  =  1  all  policies  in  II  have  the 
same  makespan:  V^(x)  =  £  1/Xj,  .  For  R  »  2  it  has  been  shown  by 

i€CQ(x) 

Bruno  et  al.  (1981)  that  an  optimal  policy  assigns  repairmen  to  failed 
components  in  Cq(x)  according  to  the  LKPT  (Longest  Expected  Processing  Time 
First)  rule.  In  the  context  of  the  series  system  an  asymptotically  optimal 
policy  assigns  repairmen  to  the  failed  components  with  the  longest  expected 
repair  times.  Notice  now  that  this  LEPT  policy  is  optimal  for  sufficiently 
small  failure  rates  (Theorem  3).  It  follows  from  this  example  that  in  the 
general  case  the  optimal  policy  does  depend  on  the  repair  rates  and  therefore 
the  result  established  in  Katehakis  and  Derman  (1984)  does  not  hold  for  R  »  2. 


For  the  parallel  systea  the  only  failed  state  is  state  0  =(0,...,0)  . 
Furthermore,  1(1.,*)  =  I(x)  +  1  for  all  x  *  1  ,  thus  it  is  easy  to  show, 

J 

using  Theorem  2,  that  the  policy  which  always  assigns  repairmen  to  the  failed 
components  with  the  smallest  repair  rates  is  asymptotically  optimal. 

4.2  Parallel  Subsystems  Connected  in  Series.  Consider  a  systea  that  is  compo¬ 
sed  of  K  subsystems  and  it  is  maintained  by  a  single  repairman.  The  i^ 
subsystem  is  composed  of  components  with  the  same  failure  rates  . 
Furthermore,  we  assume  that  all  components  have  identical  repair  rates  X  . 

Since  the  subsystems  have  identical  components,  it  is  easy  to  see  that 
the  state  of  the  systea  at  any  tiae  can  be  adequately  specified  by  a  vector 
x  =  (z^,...,Kg)  ,  where  z^  denotes  the  number  of  functioning  components  in 
subsystea  i  ,  z^  =  0,1,... .  The  structure  of  the  systea  is  specified  by 
the  sets  fl  =  {*  I  ‘  1  Vi},  B  =  {*  |  z^  =  0  for  some  i}  .  Let 
cq  *  {i  I  z^  —  0}  ,  ( n  j  ,*)  —  ( z  ^ , . . . ,  Zj_^i  n ,  z^^, . .  • ,  z^)  ,  n  2  0,1,...,  N  j , 
and  define  (njj, . . .  .nj^.z)  recursively  by  (njj.njg,*)  =  (n^,  (nj2,z) )  . 

Note  that  I(z)  =  min{z^  ,  i  =1,...,K)  .  Finally  let  7(z)  =  {i  |  z^  =  I(*)} 
and  J((njj,...,  nj^,*))  =  min  {z^  ,  i  *  J ^ , . • . , }  .  Since  subsystems  have 
identical  components  a  policy  is  specified  up  to  the  subsystem  on  which 
the  repairman  is  assigned  only.  With  this  generalization  of  notation,  using 
Theorea  2,  we  obtain  that  the  asymptotically  optimal  policy  is  given  by  the 
following  simple  rule. 

i)  If  s  €  0  assign  the  repairman  to  systea  j  if  and  only  if  either 
f(*)  =  (j>  or  p.  =  max  {p.  ,  i  €  ft*)}  . 

J  ^ 

ii)  If  z  €  B  assign  the  repairman  to  subsystea  j  if  and  only  if 
Jij  =  min  {pi  ,  i  €  CQ(x)}. 

The  proof  of  i)  essentially  involves  establishing  by  induction  on  n  that 


The  proof  of  ii)  is  easy  to  complete  by  induction  on  decreasing  |/(s)|  . 
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